Transforming Examples into Patterns for Information Extraction

نویسندگان

  • Roman Yangarber
  • Ralph Grishman
چکیده

Information Extract ion (IE) systems today are commonly based on pat tern matching. The pat terns are regular expressions stored in a customizable knowledge base. Adapting an IE system to a new subject domain entails the construction of a new pat tern base a t ime-consuming and expensive task. We describe a s trategy for building pat terns from examples. To adapt the IE system to a new domain quickly, the user chooses a set of examples in a training text, and for each example gives the logical form entries which the example induces. The system transforms these examples into pat terns and then applies meta-rules to generalize these patterns.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning information extraction patterns from examples

Abs t r ac t . A growing population of users want to extract a growing variety of information from on-line texts. Unfortunately, current information extraction systems typically require experts to hand-build dictionaries of extraction patterns for each new type of information to be extracted. This paper presents a system that can learn dictionaries of extraction patterns directly from user-prov...

متن کامل

Learning information extraction patterns from examples

A growing population of users want to extract a growing variety of information from on-line texts. Unfortunately, current information extraction systems typically require experts to hand-build dictionaries of extraction patterns for each new type of information to be extracted. This paper presents a system that can learn dictionaries of extraction patterns directly from user-provided examples o...

متن کامل

Feature selection using genetic algorithm for classification of schizophrenia using fMRI data

In this paper we propose a new method for classification of subjects into schizophrenia and control groups using functional magnetic resonance imaging (fMRI) data. In the preprocessing step, the number of fMRI time points is reduced using principal component analysis (PCA). Then, independent component analysis (ICA) is used for further data analysis. It estimates independent components (ICs) of...

متن کامل

A Logical Framework for Template Creation and Information Extraction

Information extraction is the process of automatically identifying facts of interest from pieces of text, and so transforming free text into a structured database. Past work has often been successful but ad hoc, and in this paper we propose a more formal basis from which to discuss information extraction. We introduce a framework which will allow researchers to compare their methods as well as ...

متن کامل

Extraction-Stripping Patterns during Co-Extraction of Copper and Nickel from Ammoniacal Solutions into Emulsion Liquid Membranes Using LIX 84I®

Extraction of nickel and its co-extraction with copper from ammoniacal media into emulsion liquid membrane systems (ELMs) was investigated using LIX 84I as the carrier. Measurement of the solute stripped in the internal phase of emulsion opened a new dimension in the study of the ELM extraction processes. The effect of operating parameters such as feed pH, initial feed concentration, and treat ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998